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A Biased Review of Tau Neutrino Mass Limits 
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To appear in the proceedings of the 6th International Workshop on Tau Lepton Physics, 18-21 
September 2000, Victoria (Canada) 

After a quick review of astrophysically relevant limits, I present a summary of MeV scale tau neutrino mass 
limits derived from accelerator based experiments. I argue that the current published limits appear to be too 
consistent, and that we therefore cannot conclude that the tau neutrino mass limit is as low as usually claimed. 
I provide motivational arguments calling into question the assumed statistical properties of the usual maximum 
likelihood estimators, and provide a prescription for deriving a more robust and understandable mass limit. 



1. Neutrinos and Cosmology 



All particle species, including neutrinos, must 
have been produced during the Big Bang. Should 
neutrinos be stable and sufficiently massive, their 
combined gravitational attraction would be suf- 
ficient to collapse the Universe. Our existence 
therefore allows one to infer properties of invisi- 
ble but gravitationally interacting matter. This 
argument was originally advanced by Gershtein 
and ZePdovich|l] in 1966 and was used by Cowsik 
and McClelland ^| to devise a limit on the order 
of 100(xh)eV for the sum of the masses of stable 
neutrino species, where h is the scaled Hubble 
Constant. This argument also supplies a lower 
bound on extremely heavy neutrinos at the GeV 
scale. Details of these limits can change substan- 
tially if neutrinos decay. The change depends on 
the equation of state of its decay products - a 
massive neutrino decaying to massless particles 
will effect the evolution of the universe differently 
than one with massive decay products. The effect 
also depends on the time scale of the neutrino de- 
cay relative to other time scales in the Universe, 
such as that of Big Bang nucleosynthesis. The 
contribution of decaying neutrinos to supernovae 
energetics can also be used to derive limits on v r 
lifetime and mass || . Shown in Figure [l] are a set 
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Figure 1. Some cosmological and particle physics 
limits on the v T mass and lifetime as per [|J 



of allowed and excluded regions of mass and life- 
time, as supplied by ALEPH ||. In addition to 
these cosmological bounds, the figure also shows 
the limits imposed by the lack of observation of 
neutrino-less decays such as r — * 3/i - these de- 
cays could have massive neutrinos inside of loops 
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and would therefore be sensitive to neutrino prop- 
erties. In addition, the stringent limit claimed 
by ALEPHfl is also plotted in the figure. Note 
that on such a scale, the difference in mass limits 
from different experiments would be barely visi- 
ble. Of interest is that at the MeV mass scale, 
where accelerators are currently sensitive, many 
lines cross - this indicates that this could be a re- 
gion rich with (mis)understanding, where many 
different approximations might break down. 

It would be interesting to review what effect 
recent indications of a non-zero cosmological con- 
stant might have on all these limits. The overlap 
of the Supernova and Boomerang data released 
this year indicate that the gravitational energy 
density of the Universe is to within a factor of 
a few equal to that used to derive the Cowsik- 
McClelland limit and that the ensuing argument 
is still qualitatively valid. One can also recast the 
argument in terms of the stability of clusters, and 
this still yields a comparable limit Q . 

2. Recent Results of Interest 

The DONUT collaboration has recently an- 
nounced that it has made the first observation of 
tau neutrino appearance. Although their results 
(reported elsewhere in these proceedings) do not 
tell us very much about the mass of this neutrino, 
they do tell us that it does exist. 

The KARMEN collaboration, which had pre- 
viously reported an anomaly consistent with a 
weakly interacting neutral particle emitted in 
pion decays with a mass of 33.9 MeV ||, no longer 
sees this anomaly in a larger dataset. Their new 
results are reported elsewhere in these proceed- 
ings. 

Possibly the most talked about recent results 
are those coming from the SuperKamiokande col- 
laboration, also reported elsewhere in these pro- 
ceedings. They report that their data is con- 
sistent with the mixing of muon type neutrinos 
into a different state, most probably r neutri- 
nos. Should these results hold up, they are a 
strong argument against a massive neutrino in 
the allowed MeV mass range. Unfortunately, 
SuperKamiokande has yet to take into account 
the possibility that neutrinos are massive and 



can decay. Given that all other known parti- 
cles that oscillate also decay, one might legiti- 
mately ask whether this is also the case in the 
neutrino sector. One could hope that low mass, 
long lived electron type neutrinos, medium lived 
muon neutrinos and short lived massive tau neu- 
trinos might help explain the solar and atmo- 
spheric neutrino puzzles. Until such possibilities 
are ruled out it might be premature to assume 
that mixing has been proven and that the MeV 
mass scale for the tau neutrino is no longer inter- 
esting. Simply showing consistency with a given 
model does not necessarily rule out other models. 

3. Accelerator Based Non-Endpoint Lim- 
its 

At electron-positron colliders, it is possible to 
produce mono-energetic tau pairs. When these 
decay to hadrons and exactly one neutrino each, 
the tau flight direction on each side must have 
been along a cone around the direction of the 
hadronic decay products. The size of this cone 
depends on the tau and neutrino mass. Requir- 
ing that the common r flight direction is on the 
overlap of the two cones allows one to measure 
rn^ — m„. This method was used by CLEO§, 
resulting in the mass distribution shown in Fig- 
ure ||. The sharp drop near 1.8 GeV provides the 
required mass. Combining this result with the 
BES result for the r massQ, CLEO derives an 
upper limit of 60 MeV on the neutrino mass at 
95% C.L. 

If the tau neutrino were more massive than the 
tau, the tau could not decay: the mass of the neu- 
trino restricts the allowed phase space and decay 
rate. One can set a limit on the neutrino mass by 
carefully measuring branching ratios of the tau. 
The theoretically best predicted decays are the 
fully leptonic decays r — > lv\v T where I = e, [A, 
with the main uncertainties coming from the tau 
mass and lifetime. These are measured at the per 
cent level. Knowing the pion and kaon form fac- 
tors, once can also use the single body hadronic 
decays of the tau. Using the Particle Data Group 
best fit values for all these parameters, Swain and 
Taylor || derives a 95 % confidence level upper 
limit on the tau neutrino mass at 68 MeV. 
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Figure 2. Two Cone Mass for — determi- 
nation from - the location of the sharp drop 
determines the limit. 



4. Accelerator Based Endpoint Limits 

The method that has given the most stringent 
limits on the tau neutrino mass attempts to fit 
the two dimensional spectrum of hadronic de- 
cay product mass and energy for r's produced 
at electron-positron colliders. As shown in Fig- 
ure near the endpoint, the kinematically al- 
lowed contour depends very strongly on the neu- 
trino mass. Not only does the contour depend 
on the neutrino mass, but for any fixed energy, 
the hadronic mass spectrum shape also depends 
on the neutrino mass: points away from the limit 
contain some information on the neutrino mass. 
However, for a fixed hadronic mass, the deriva- 
tive of the hadronic energy with respect to the 
neutrino mass is effectively flat except for a delta 
function at the contour limit[]. Finite smearing 

1 This means that in the limit of a detector with no smear- 
ing, only those points at the contour limit would provide 
information on the neutrino mass in the hadronic energy 
direction, while along the mass axis all points would pro- 
vide neutrino mass information. The mass limit could 
potentially be set by one point with mis-measured energy 
- this problem is far less severe along the mass axis where 
all points contain information. This approximation points 
out that the systematic error estimation due to smearing 
has to be done very carefully. There is also a need for a 
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Figure 3. A typical scaled hadronic energy vs 
mass distribution (from Note the kinemat- 
ically allowed boundaries for different neutrino 
masses and the typical error ellipses around the 
events. 



dilutes further the neutrino mass sensitivity, as 
do initial and final state radiation. Note that 
ISR/FSR cause points to drop along the energy 
axis in the figure, but do not smear them along 
the mass direction. These effects are calculable 
however. 

One can distinguish two different types of de- 
cays used for this kind of analysis with differ- 
ent sensitivities to systematic errors. Decays to 
many pions (e.g. r — > hirv) tend to produce 
events near the sensitive endpoint but suffer from 
small branching ratios and low reconstruction ef- 
ficiencies. Decays with few particles in the final 
state (e.g. r — > "iixv ) tend to produce events far 
away from the endpoint, but have efficiencies and 
branching ratios that are large - one hopes that 
the efficiencies are large enough that a rare event 
near the endpoint will be reconstructed. 

The most general likelihood function is of the 

careful evaluation of the analytic and statistical properties 
of the likelihood function in the presence of an underly- 
ing discontinuous estimation parameter dependence. No 
such examination has been done explicitly in any of the 
neutrino mass literature. 
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form: 

£(M VT )=P(N obs ,M UT ) J[ {a£ Stg nai + {l-a)£BGD) 

Data 

The first term, used only by Jl(J] and fL2| , is a 
Poisson term relating the number of events near 
the endpoint as a function of the number of events 
far from the endpoint and the neutrino mass. One 
uses this term as a compromise to avoid fitting 
over many points which do not have much neu- 
trino mass information, and also lessen the depen- 
dence on the explicit form of the physics function 
in the low mass region. 

The product term is taken over all accepted 
data events, and is composed of a signal term 
^■Signal and a background term Cbgd- The sig- 
nal term is composed of a convolution of the ex- 
pected detector smearing, efficiency and the dif- 
ferential decay width. The neutrino mass enters 
only through the decay width and the limits of in- 
tegration for the convolution - again, the energy 
dependence on neutrino mass in this term comes 
in only as an integration limit, and not a slope. 
The differential decay width depends on the V- 
A nature of the weak interaction, the underlying 
physics of the event and the radiative corrections, 
including ISR and FSR. The background term is 
a parameterization of events which are not signal, 
whether originating from (mis-reconstructed) tau 
decays or other physics. These terms in general 
do not depend on the neutrino mass - this is true 
to a high degree even for the mis-reconstructed 
r events since they tend to be highly smeared 
over the plot. This lack of dependence on neu- 
trino mass makes this type of likelihood divergent 
when all neutrino masses are considered so one 
introduces a prior distribution, cutting off neu- 
trino masses greater than say 100 MeV. The back- 
ground shape is generally determined either from 
Monte Carlo or directly from non-tau decay data. 
The relative weight of signal and background, a, 
is usually determined by fitting for the number of 
events above the endpoint region. 

The likelihood is then used to determine an up- 
per limit on the mass of the neutrino. This is 
generally done by integrating^ the likelihood to a 

2 Note that strictly speaking it is a product of the likcli- 



95th percentile above MeV or by finding where 
the log of the likelihood drops by 1.92 from its 
peak. Neither method is more correct than the 
other, rather each expresses a different philosophy 
on the meaning of the upper limit. In all cases 
published so far the application of either method 
gives a consistent answer to within w 0(5MeV). 

5. Current Results 

Since only one new result has been published 
since the last conference, readers desiring a full 
review of previous results should refer to R. Mc- 
Nulty's review in the previous set of proceedings 
]lT| . These results are briefly summarized in Ta- 
ble 0. The new result is from CLEO|jl| which 
has examined the neutrino mass using the de- 
cay t — > 3irir°v for the first time. This sample 
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Figure 4. Four pion invariant masses for events 
with an uj (top) and events with out an uj (bot- 
tom) for CLEO's r — > Ztttt^v sample. The his- 
togram shows the Monte Carlo expectation. 



is composed of 29,000 decay events, as shown in 
Figure [|. The fit includes an explicit term ac- 
counting function for mis-identified tau decays, 
corresponding to 7% of the event sample. Some 
3% of the sample is accounted for by a non-tau 
background term. The spectral function in this 
decay was fit below the neutrino mass sensitive 

hood function and a to p h at prior distribution which is 
being integrated, as per |13[ 



region to a combination of non-resonant, rho and 
omega subcomponents. This fit was then ex- 
tended into the endpoint region. The resulting 
likelihood distribution is shown in Figure ||. The 
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Figure 5. The neutrino mass likelihood distri- 
bution for CLEO's t —> iim^v sample. Note the 
raw 95th percentile at 22 MeV. 



95th percentile above MeV is at 22 MeV for the 
raw data, and the total systematic error, added in 
linearly to conform with the standard set by pre- 
vious studies, sets the final limit at 28 MeV. The 
main systematic error comes in through the spec- 
tral function dependence, the 7r° energy scale, as 
well as the charged track momentum reconstruc- 
tion. 

6. General Remarks on Published Results 

A glance at Table [l] summarizing all published 
results shows that all limits are generally within 
the same range, in spite of a very large range of 
event sample sizes. 

The typical upper limit in this table is in 
the 30 MeV range. One would naively expect 
that the scatter in the most likely neutrino mass 
for each experiment should be on the order of 
30MeV/1.64 = !8MeV - this is manifestly not 
the case. For the purposes of the following, let 
us imagine that the true neutrino mass is MeV, 
and that each experiment has a resolution of 18 
MeV on the neutrino mass. I believe that 7 of the 
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published results in the table are statistically in- 
dependent, in addition to the unpublished DEL- 
PHI result JD]]. Once can ask what the Gaussian 
probability is that all 7 different published exper- 
iments could all arrive at the result that the most 
likely neutrino mass is between MeV and 18 
MeV - this is obtained from a one tailed Gaussian 
distribution and is equal to 0.64 7 = 4%. If we in- 
clude the unpublished DELPHI97(ll| result and 
ask what the probability is that the most likely 
mass is less than 18 MeV, including negative val- 
ues, the probability is somewhat more promising, 
namely $j * 0.84 7 * 0.16 1 = 38%. In this case 
however, the probability of all the maxima being 
located within ±18M eV of MeV is derived from 
a two-tailed Gaussian as 0.68 8 = 5%. If one ar- 
gues that in fact the typical resolution is larger 
than 18 MeV, then these results are even more 
extreme. These simple considerations lead me to 
conclude that some subtle series of systematic bi- 
ases must be driving down the published neutrino 
mass limitf^. 

7. Subtleties 

How should one quote the neutrino mass limit? 
There is no accepted way to judge the relative 
merits of different limits. The PDG[jll| simply 
quotes the lowest published value for the upper 
limit. This prescription is unfortunately biased 
towards finding a massless neutrino. 

One can attempt to combine the likelihood 
functions of different experiments and thus hope 
to obtain a clear answer. However, should any one 
of those experiments suffer from an unfortunate 
fluctuation due to an undetected systematic error 
towards low m v , the resulting combined distribu- 
tion will also suffer from this. The problem with 
this method is that all of the experimental results 
are considered on an equal footing. One should 
clearly have a mechanism to de-weight less reli- 
able results. Combining limits from different pub- 
lished results is simply quite difficult. Not only 
is it not clear how to incorporate different sys- 

3 One might also argue that the Gaussian approximation 
used in the above is invalid for such small statistics - this 
of course calls into question the meaning and validity of 
the quoted upper limits. 



tematic errors from different experiments prop- 
erly, but the fact that the different measurements 
tend to use slightly different functional forms for 
their likelihood makes a blind multiplication of 
likelihood functions difficult to interpret. 

One might argue that, in the face of similar ex- 
perimental resolutions, the experiment with the 
largest sample should have the most robust result, 
just like in the case of a branching ratio measure- 
ment. Neutrino mass limits differ in some im- 
portant ways from the limits one typically makes 
on branching ratios. A branching ratio limit can 
treat each event as having an equal amount of 
information. However, the information content 
of an event in a neutrino mass limit depends on 
its reconstructed location in the energy vs mass 
plane and on its associated expected smearing, 
as well as the particular spectral function of the 
mode under consideration. This means that the 
large N limit where we are all typically com- 
fortable in interpreting likelihoods as statistically 
meaningful depends on what region of neutrino 
mass one is investigating. "Large N" for a 100 
MeV limit can be much smaller than "large N" for 
a 10 MeV limitQ. In addition, the range of possi- 
ble reconstructed masses for an event depends on 
the value of the true neutrino mass (this is most 
clearly seen for a perfect detector with delta func- 
tion resolution.) An important property of max- 
imum likelihood estimators is that they are effi- 
cient: if an estimate of minimum variance exists 
for a parameter, then the maximum likelihood 
should find it|jl|. However, the derivation of 
this minimum variance bound, known as the Rao- 
Cramer-Frechet bound, requires that the range 
of the underlying event probability distribution 
not depend on the parameter being estimated p^] : 
this is explicitly not true for the neutrino mass 
fits. Thus, with the Rao-Cramer-Frechet bound 
in question, it is not clear that the maximum like- 
lihood estimator can find the estimate with the 
minimum variance^. Of course in the opposite 



4 Similarly "large N" for a mode such as t — > 3nu has to 
be much larger than "Large N" for the mode r — > 5ttu. 

5 One might argue that the presence of smearing, which 
allows events to be reconstructed above the kinematically 
determined endpoint, makes the above argument less com- 
pelling. However, this line of thought leads to the conclu- 
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extreme, if the smearing is large enough, the in- 
formation content of any point tends to zero, and 
the minimum variance bound tends to infinity. 
These observations point out the need for a seri- 
ous evaluation of the form of the likelihood used 
for neutrino mass estimation and its properties. 

One should also keep in mind that showing con- 
sistency with a massless neutrino as one typically 
does by quoting an upper limit is not the same 
as excluding massive neutrinos. One can occa- 
sionally get low fluctuations in the neutrino mass 
limit even for massive neutrinos .0 

Of course one very important consideration 
that must be taken into account is the sociolog- 
ical bias towards massless neutrino results. Ex- 
perimental groups tend not to work as hard to 
publish results that are not more stringent that 
their competitor's. 

8. Upper Limits Are Not What You Think 
They Are 

As an illustration that neutrino mass limits are 
not straightforward, consider Table |2|. This table 
summarizes the number of times one gets an up- 
per limit at 27 MeV or lower using the CLEO 
Monte Carlo for reconstructed data samples of 
25 events and 450 events for two different input 
neutrino masses of MeV and 50 MeV. An ex- 
perimenter is confronted with an experimentally 
derived limit, and from this must venture a guess 
as to what the input (true) neutrino mass might 
have been. With this in mind, with a small sam- 
ple of 25 events one is much more likely to mis- 
takenly accept or reject a massive neutrino than 
one is with a large 450 event sample. Thus with 
similar detectors at similar energies and the same 
decay mode, one is more likely to infer the cor- 
rect conclusion about the neutrino mass using in- 
formation from the largest reconstructed sample 
instead of using the lowest estimated upper limit. 
It is statistics, not lucky events near the endpoint 
that give discriminatory power. It is also inter- 

sion that a perfect detector with no smearing is less useful 
for determining a neutrino mass than one with smearing. 
Such a repugnant thought is not consistent with our un- 
derstanding of how measurements should work. 
6 See hypothesis testing, and Type I and Type II errors 
in any introductory statistics book. 



esting that even with a sample as large as 450 
events, the figure of 95% appears nowhere in this 
table - this indicates that the variance in results 
is still not small. 

9. Exhortations to future limit setters 

In light of the observations made above, I would 
like to counsel future limit setters to heed the 
following advice: 

• Restrict your fit region and use a Poisson 
Term: Since most events far from the end- 
point have little neutrino mass information 
apart from helping to normalize the ex- 
pected number of events near the endpoint, 
it is not worth CPU time to fit these events 
in detail. 

• Use a Background Function: If one wishes 
to avoid using a background function, one 
must use tighter and tighter cuts as the ac- 
cepted luminosity increases. This can result 
in the paradoxical situation in which an ex- 
periment ends up with less sensitivity as its 
sample size grows. The inclusion of a back- 
ground function remedies this strange state 
of affairs. 

• Publish the expected reach < M95 > (and 
its variance) for a massless neutrino hypoth- 
esis. This will help establish whether the 
limit from data is meaningful or a danger- 
ous fluctuation. 

• Publish the discriminatory power of your 
experiment: how often does your data de- 
rived limit occur for a large (say 50 MeV) 
neutrino mass? 

• Compare the limit derived from mass in- 
formation only to the limit derived from 
the energy vs mass distribution: if the two 
dimensional limit is substantially different 
from the one dimensional one, then the limit 
is being driven by a small number of events 
or a very unlikely event distribution. 

• Carefully examine large smearing tails: 
comments given in the text point out that 



8 



Table 2 

The probability of obtaining a 95th percentile mass limit of 27 MeV or lower for different input neutrino 



masses 


and event sample sizes. 






25 Events 


450 Events 


OMeV 


3% 


67% 


50 MeV 


1% 


« 1% 



The values here are calculated using the CLEO Monte Carlo and detector simulation for r — > 57w decays. 
Exact values will vary from experiment to experiment, and decay mode to decay mode. 



the sharp cut-off in reconstructed energy as 
determined by the neutrino mass make the 
properties of the neutrino mass estimator 
unclear. At the very least, one should care- 
fully examine the effects of systematic error 
along the energy axis. Ultimately, as sam- 
ples grow and the relevant fit region shrinks, 
fluctuations due to the unknown tails of the 
smearing functions in both mass and energy 
must dominate the fits. 

10. How to Set a Limit We Can All Un- 
derstand 

In light of the suspicious properties of the max- 
imum likelihood estimator raised in this review, 
it would be very useful if experimenters would 
calibrate their estimators using a Monte Carlo 
method. The method proposed herein answers 
the question of "What masses are ruled out by 
my data ?" 

• Generate a large number of Monte Carlo 
samples with sizes comparable to those ob- 
tained in data for different input neutrino 

masses ( m 1 ™) 

• For each of these samples, form the like- 
lihood, and find the neutrino mass which 
maximizes the likelihood, m^ c 

• Plot rhf c vs m™ 

• Form the likelihood for the data sample and 
find its maximum rh Data 

• Find m™ such that rhff c is larger than 
m Data 95% of the time 

• Set the upper limit to this value 



Unfortunately this method does not provide a 
nice way to combine limits from different exper- 
iments. However given the points made in this 
review, perhaps it is better to suffer from this 
than to combine likelihoods with unclear statisti- 
cal properties, and possibly incompatible mean- 
ings. 

11. Conclusions 

I have briefly reviewed the current constraints 
on an MeV scale tau neutrino mass. These con- 
straints are from astrophysical observations as 
well as from terrestrial observations and all of 
them taken at face value allow for the existence of 
an unstable v T with a mass on an MeV scale. Re- 
cent SuperKamiokande results indicate that the 
(stable) neutrinos participating in atmospheric 
disappearance have a mass well below an MeV 
and are consistent with stable tau neutrinos. An 
inspection of the two dimensional limits on the 
t neutrino mass derived from experimental data 
reveal that the ensemble of limits is too consis- 
tent. I have also for the first time shown that the 
usual assumption that the likelihood is well be- 
haved for this technique is questionable. 1 have 
ended by recommending what extra information 
future limit setters should publish to allow oth- 
ers to gauge the believability of their limits, and 
have also recommended a method more suited to 
obtaining an answer to the question of what neu- 
trino mass is excluded by the observed data than 
the methods used up to now. 
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